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Background and prior art 

Various multi-computing architectures are known from the prior art where a 
plurality of processing units is coupled to form a cluster. Such architectures are 
used in parallel processing and also in the emerging field of blade computing. 

5 Blade computing relies on blade servers, which are modular, single-board 
computers. An overview of blade computing is given in "Architectures and 
Infrastructure for Blade Computing", September 2002, Sun microsystems and 
"THE NEXT WAVE: BLADE SERVER COMPUTING", Sun Microsystems 
(www.sun.com/servers/entrv/blade ), 

10 A content load balancing blade is commercially available from Sun 
microsystems ("Sun Fire TM B10n). This blade provides traffic and content 
management functionalities. Content load balancing is achieved based on 
URLs, CGI scripts and cookies; server load balancing is achieved based on 
server loads, response times, and weighted round-robin algorithms. 

15 US patent application no. 20030105903 shows a web edge server, which 
comprises a number of blade servers. A switch and an information distribution 
module are provided for the purpose of balancing. The information distribution 
module receives an information message, performs processing on the message 
to determine a destination, and forwards a message toward the determined 

20 destination via an internal communications network. 

Summary of the invention 

The present invention provides for a method of assigning objects to processing 
units of a cluster of processing units. Each one of the processing units has a 
certain storage capacity and load capacity. The storage capacity of a 
25 processing unit determines the maximum aggregated size of objects that can be 
stored by the processing unit. The load capacity of the processing unit 
determines the maximum processing load that the processing unit can handle. 
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For example, the load capacity of a processing unit can be indicative of the 
maximum number of access operations the processing unit can provide. Access 
operations may comprise both read accesses (select) and write accesses 
(update, insert, delete) to objects stored on the processing unit. For example 
the load capacity can be expressed as the maximum number of access 
operations per time unit the processing units can handle. 

In order to make maximum usage of the available data processing capacity 
provided by the cluster of processing units the distribution of the objects over 
the processing units needs to be balanced. This is accomplished by calculating 
an Index for each object based on object size and object load. For example, the 
object load is expressed as the expected mean number of access operations 
per time unit to the object The objects are sorted by index in order to provide a 
sequence. 

In the following it assumed without restriction of generality that the sequence is 
sorted in descending order. In this instance the procedure for assigning of 
objects to processing units starts with the first object of the sequence. One or 
more of the objects of the sequence are assigned to one processing unit in 
sequential order until a remaining storage capacity and/or a remaining load 
capacity of that processing unit is too small for consecutive objects of the 
sequence. When this condition is fulfilled, the procedure is carried out for the 
next processing unit, whereby the objects that have been previously assigned to 
the preceding processing unit are deleted from the sequence. This way a 
minimum number of processing units that are required for handling a given set 
of objects can be determined. 

In accordance with a preferred embodiment of the invention the remaining 
storage capacity of a processing unit is determined by the difference between 
the storage capacity of the unit and the aggregated size of the objects that have 
been assigned to the processing unit. Likewise the remaining load capacity of a 
processing unit is determined by the difference between the load capacity of the 
unit and the aggregated loads of objects that have been assigned to the 
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processing unit. On the basis of these definitions of the remaining storage 
capacity and of the remaining load capacity the minimum number of processing 
units is determined. 

In accordance with a further preferred embodiment of the invention the 
5 balancing procedure is performed again in order to further improve the quality of 
the balancing. For this purpose the largest gap between the aggregated sizes of 
objects being assigned to one of the processing units and the largest gap 
between the aggregated loads of objects being assigned to one of the 
processing units and the load capacity are determined. 

10 The size gap is divided by the minimum number of processing units and the 
result of the division is subtracted from the maximum storage capacity to 
provide a size threshold level. Likewise, the load gap is divided by the number 
of processing units and the result of the division is subtracted from the load 
capacity to provide a load threshold level. When the procedure for assigning the 

15 objects to the processing units is performed again, the definition of the 
remaining storage capacity is the difference between the aggregated size of 
objects being assigned to the processing unit and the size threshold level 
whereas the definition of the remaining load capacity is the difference between 
the aggregated load of the objects being assigned to the processing unit and 

20 the load threshold level. As a result of the renewed performance of the 
assignment procedure, the gap can be substantially reduced. 

In accordance with a further preferred embodiment of the invention the 
theoretical storage capacity limit is used as a size threshold. This size threshold 
is obtained by calculating the difference between the total of the storage 
25 capacities of the processing units and the total of the sizes of the objects and 
dividing the difference by the minimum number of processing units. The result 
of the division is subtracted from the storage capacity which provides the 
theoretical limit. 

Likewise the theoretical load capacity limit is used as a load threshold. This load 
30 threshold is obtained by calculating the difference between the total of the load 
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capacities of the processing units and the total of the loads of the objects and 
dividing the difference by the minimum number of processing units. The result 
of the division is subtracted from the load capacity which provides the 
theoretical load capacity limit. 

5 On this basis the assignment procedure is performed again whereby the 
remaining storage capacity is defined as the difference between the aggregated 
size of the objects of the processing unit and the size threshold whereas the 
remaining load capacity is defined a the difference between the aggregated 
load of the objects of the processing units and the load threshold. Typically it 
10 will not be possible to assign all of the objects to the minimum number of 
processing units on this basis. If this is the case one or more iterations are 
performed. 

For one iteration an excess amount of memory is divided by the minimum 
number of processing units. The result of the division is added to the size 

15 threshold. Likewise an excess load is divided by the minimum number of 
processing units. The result of the division is added to the load threshold. On 
the basis of the incremented size threshold and/or load threshold the assigment 
procedure is performed again. This process continues until all objects have 
been assigned to the minimum number of processing units. This way the quality 

20 of the balancing is further improved. 

In according with a further preferred embodiment of the invention the size 
threshold for performing the assignment procedure is varied between the 
theoretical storage capacity limit and the actual storage capacity. Likewise the 
load threshold is varied between the theoretical load capacity limit and the 
25 actual load capacity. Preferably a new assignment procedure is performed for 
each permutation of the size threshold/load threshold that can be thus obtained. 
For each of the resulting assignments of objects to processing units a statistical 
measure is calculated. This statistical measure is a basis to select one of the 
assignments for optimal balancing. 
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In according with a further preferred embodiment of the invention the standard 
deviation or variance of the sum of the indices of objects assigned to a 
processing unit is used as a statistical measure. The assignment having the 
lowest overall quality measure is selected. 

5 in accordance with a preferred embodiment of the invention the object sizes 
and object loads are normalised for the calculation of the indices. Preferably an 
index of an object is calculated on the basis of the sum of the normalised object 
size and normalised object load and the absolute difference of the normalised 
object size and normalised object load. Preferably the index is obtained by 

10 calculating a linear combination of the sum of the normalised object size and 
normalised object load and the absolute value of the difference of the 
normalised object size and normalised object load. 

In accordance with a preferred embodiment of the invention each one of the 
processing units is a blade or a blade server. One of the blades can have a 
15 program that implements the principles of the present invention in order to 
perform balancing. This way the number of swap-operations between the 
blades can be minimised. 

In accordance with a preferred embodiment of the invention the principles of the 
invention are implemented in an application program running on a personal 

20 computer. The application program is provided with a list of objects and the 
estimated sizes and loads of the objects that need to be handled by the cluster 
of processing units. On the basis of the object sizes and the object loads the 
minimum number of processing units that are required for the processing task 
are determined. This information can form the basis for a corresponding 

25 investment decision of a customer. 

It is to be noted that the present invention is not restricted to a particular type of 
object. For example, data objects such as tables, arrays, lists and trees are 
distributed to processing units, e.g. blades, in accordance with the principles of 
the present invention. For example, each one of the processing units runs a 
30 data processing task to which the respective objects are assigned. 
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Brief description of the drawings 



In the following preferred embodiments of the invention will be described in 
greater detail by way of example only, by making reference to the drawings in 
which : 



Figure 1 
Figure 2 

10 Figures 
Figure 4 
Figure 5 

Figure 6 
15 Figure 7 
Figure 8 
Figure 9 



Figures 10 
20 to 14 

Figures 15 
and 16 



Figure 1 7 



25 



is a schematic block diagram of a modular computer system having 
a cluster of blades, 

is illustrative of a flow diagram for assigning of objects to blades 
and for determining the minimum number of blades, 

is an example for tables that need to be assigned to blades, 

shows the normalised size distribution of the tables of figure 3, 

shows the loads of the tables in terms of the mean number of read 
accesses per second, 

shows the normalised loads, 

shows the normalised sizes and loads per table, 

shows a sorted sequence of the tables, 

shows the sorted sequence of the tables illustrating the normalised 
sizes and loads of each table, 

show the assignment of tables to blade 1 of the cluster of blades 
show the assignment of tables to blade 2, 

shows the resulting assignment of tables to blades that is obtained 
on the basis of the method of figure 2, 
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Figure 18 is illustrative of a preferred embodiment of the invention where the 
procedure of figure 2 is performed again with lower size and load 
thresholds, 

Figure 19 is illustrative of a preferred embodiment of a method of the 
invention where the load and size thresholds are varied iteratively, 

Figure 20 is illustrative of a further preferred embodiment of the invention 
where the size and load thresholds are varied in predetermined 
steps, 

Figure 21 is illustrative of a computer system performing the assignment of 
objects to blades. 

Detailed description 

Figure 1 shows cluster 100 of blades B1, B2, B3, ... BN. Each one of the blades 
has processor 102 and memory 104. In the example considered here, all 
memories 104 have the same storage capacity and all blades have the same 
load capacity. The blades are coupled by a network 106, such as a bus system. 
The number N of blades of cluster 100 needs to be chosen such that a given 
number of M objects of varying sizes and loads can be handled. 

For.exampie, cluster 100 implements a so-called search engine. In this instance 
identical search processes run on each one of the blades. The assignment of 
data objects, such as index tables, to blades can be stored in a dispatcher unit 
(not shown on the drawing) of cluster 100. This way data objects are assigned 
to blades and data processing tasks running on the blades. 

Figure 2 shows a procedure for assigning the objects to blades and to thereby 
determine the minimum value for N. 

In step 200 an sorting index is calculated for each one of the M objects. An 
sorting index of an object is indicative of the amount of blade resources the 
object requires. The sorting index serves to sort the objects in decreasing order 
of blade resource requirements. 
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For example the sorting index is calculated on the basis of the sum of the 
normalised object load and normalised object size plus the absolute value of the 
difference of the normalised load and size or a linear combination thereof. 

In step 201 a sorting operation is performed in order to sort the M objects by 
5 sorting index. The corresponding object sequence is provided in step 202. In 
step 204 the index i for the blades is initialised to 1 . 

In step 206 processing of the object sequence starts with the first object of the 
sequence, i.e. the object having the largest sorting index value. The first object 
of the sequence is assigned to a first one of the blades, i.e. blade B1, in step 
10 206. In step 208 the first object that has been assigned to blade B1 is deleted 
from the sequence. 

In step 210 the sizes of the objects that have already been assigned to blade 
B1 are summed up in order to provide an aggregated object size of blades B1. 
Next the size of a gap G s between the aggregated object size of blade B1 and a 
J 5 size threshold Ts is calculated. When the assignment procedure of figure 2 is 
carried out the first time, the size threshold Ts is the storage capacity of one of 
the blades. 

In step 211 the loads of the objects that have already been assigned, to blade 
B1 are summed up in order to provide an aggregated load of blade B1. Next a 
20 gap G L between the aggregated object loads of blade B1 and a load threshold 
T L is calculated. When the assignment procedure of figure 2 is carried out the 
first time, the load threshold T L is the load capacity of one of the blades. 

In step 212 it is determined whether there is a next object in the ordered 
sequence that fits into both gaps G s and G L . In other words, a consecutive 
25 object following the first object in the object sequence that has an object size 
small enough to fit into gap G s and at the same time has an object load that is 
small enough to fit into Gl is searched. 
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The next consecutive object in the sequence that fulfils this condition is 
assigned to blade B1 in step 214 and deleted from the sequence in step 216 
before the control goes back to step 210. 

If there is no such object that fulfils the condition of step 212, step 218 is carried 
5 out In step 218 it is determined whether all objects have already been assigned 
to blades. In other words, in step 218 it is checked whether the sequence is 
empty. If this is not the case the index i is incremented in step 220 and the 
control goes back to step 206 in order to assign remaining objects of the 
sequence to the next blade B2. 

10 If the contrary is the case the index i is the minimum number N of blades that 
are required to handle the M objects, i.e. i =N. This number is output in step 
220. The minimum number N of blades that are required to handle the M 
objects can be a basis for an investment decision for purchasing of a 
corresponding number of blades. The assignment of objects to blades is output 

15 in step 224 in order to visualise the quality of the object size balancing. 

Figure 3 shows an example of objects to be assigned to blades. In the example 
considered here the objects are a number of 20 different tables having various 
sizes between 50MB and 3,566MB as indicated in figure 3. For example, table 1 
has a size of 3,250MB, table 2 has 250MB, table 3 has 750MB, etc. The table 
20 sizes can be actual table sizes or average table sizes that have been obtained 
by monitoring a real-life data processing system. Alternatively the table sizes 
are estimates for the purpose of planning cluster 100 (cf. figure 1 ). 

Figure 4 shows the normalised table sizes of the tables of figure 3. The 
normalised size of a table is obtained by dividing the table size by the maximum 
25 storage capacity of one of the blades. 

Figure 5 shows the loads of the tables. In the preferred embodiment considered 
here the mean number of access operations (selects) per second that are 
performed or expected to be performed with respect to a table is used as a load 
indicator. In the example considered here the load as expressed by the number 
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of read accesses per second to a table varies between five read accesses per 
second (table 6) and 950 read accesses per second (table 8). The table loads 
given in figure 5 can be actual table loads or average table loads that have 
been obtained by monitoring a real-life data processing system. Alternatively 
the table loads are estimates or simulation results for the purpose of planning 
cluster 100. 

Figure 6 shows the normalised table loads. The normalised table load of a given 
table is obtained by dividing the number of read accesses per second of that 
table by the load capacity of one of the blades. 

In the preferred embodiment considered here the storage capacity of a blade is 
4,096MB. Hence a normalised table size of one indicates that the table has the 
absolute maximum size that can be handled by a given blade hardware. 

The load capacity of a blade is the maximum possible access load that can be 
handled by a core engine running on one of the blades in the example 
considered here. This maximum value can be determined by benchmarks, by 
experiment or simulation. The load capacity depends on various parameters 
such as hardware and software characteristics and network bandwidth if a 
network is used to perform the table accesses. In the preferred embodiment 
considered here, the load capacity of one of the blades is 1,000 read accesses 
per second. For the purpose of explanation only read accesses are considered 
here. However, other typical data processing tasks, such as accesses that 
involve changes to the data, can also be taken into consideration for 
determining load capacity and table loads. 

Figure 7 shows a diagram illustrating the various normalised table sizes and 
table loads. As apparent from figure 7 there are tables that require only a 
minimum amount of blade resources, especially table 13, that has a small table 
. size and a low table load. On the other hand there are tables that require a 
large amount of blade resources as they have both large table size and high 
table load. The distribution of tables to blades aims to balance the usage of the 
respective blade resources with respect to both size and load. The two 
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constraints, i.e. size and load, are combined into a common sorting index that 
provides an objective measure for the amount of blade resources a given table 
requires. 

The following definition of the sorting index is used for the purposes of 
5 explanation only and without restriction of generality: 

Sorting index = W1 *(size + load) + W2 *absolute value (size - load), 

where size is the table size, 

load is the table load 

W1 is a weighting factor for (size + load) and 

10 W2 is a weighting factor for the absolute value of the difference of size and 
load. 

For the purposes of the following example the weighting factors W1 and W2 are 
set to one without restriction of generality. 

In this case the above expression evaluates as follows: 

15 If size > load: sorting index -2* size 

If size = load: sorting index = 2 * size = 2 * load 

If size < load: sorting index = 2 * load. 

Figure 8 shows a sorted sequence in descending order of the sorting indices of 
the tables. The table that has the largest sorting index value and thus requires 
20 the largest amount of blade resources is table 8, whereas the table having the 
smallest sorting index value and which does require the lowest amount of blade 
resources is table 13 as apparent from figure 8. 

Figure 9 shows the sorted sequence of figure 8 illustrating the normalised table 
sizes and table loads of the individual tables. 
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Figure 10 illustrates the first iteration for assigning tables to blades based on the 
ordered sequence as shown in figure 8 and 9. It is assumed that each one of 
the blades has a storage capacity that corresponds to a normalised size 
threshold T s = 1 and a load capacity that corresponds to a normalised load 
threshold T L . 

The assignment procedure starts with the first table of the sorted sequence, i.e. 
table 8. Table 8 is assigned to blade 1 as illustrated in figure 10. This results in 
a normalised gap G s of unused storage capacity G s = 0,6948 and a gap G L of 
unused load capacity G L = 0,05. 

Next consecutive tables in the ordered sequence are searched that have table 
sizes and table loads that fit into the respective gaps G s and G L . These are 
tables 6 and 13. As. table 6 precedes table 13 in the ordered sequence, it is 
assigned to blade 1 as illustrated in figure 11. 

Next the aggregated table size and the aggregated table load of blade 1 is 
updated as illustrated in figure 12. This results in a updated normalised gap G s 
= 0,6362 and an updated normalised gap G L = 0,045 as illustrated in Fig. 
13.Now tables of the ordered sequence that are consecutive to table 8 that fit 
into both gaps G s and Gi_ are searched. 

The only table that fulfils both conditions is table 13 which is thus assigned to 
blade 1 as illustrated in figure 13. The result is shown in figure 14. The updated 
normalised gap G s is 0,6167 and the updated normalised gap G L is 0,03. No 
table within the ordered sequence satisfies both constraints. 

As there remain unassigned tables in the sequence an additional blade 2 is 
required as illustrated in figure 15. By definition the first table of the remaining 
ordered sequence, i.e. table 20 is assigned to blade 2. This results in 
normalised gaps G s = 0,1294 and G L = 0,5600. The only table that satisfies 
both constraints is table 12 which is thus moved onto blade 2 as illustrated in 
figure 15. 
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Figure 16 illustrates the result of this assignment. The updated normalised gap 
G s is 0,0024 and the updated normalised gap G L is 0,4500. None of the tables 
of the remaining ordered sequence satisfies these constraints such that an 
additional blade 3 is required etc. This procedure is carried out until the ordered 
5 sequence is empty, i.e. after all tables have been assigned to one blade. The 
resulting assignment of blades and the resulting minimum number of blades is 
illustrated in figure 17. In the example considered here the minimum number N 
of blades that is required for handling of alt of the tables is N = 10. 

In order to further improve the quality of the balancing the method of figure 18 
10 can be carried out on the basis of the result provided by the method of figure 2. 
In step 800 the largest remaining gaps, G s and G L are determined as illustrated 
in figure 17. In the example considered in figure 17 the largest remaining gap 
Gs is on blade 1 and the largest remaining gap G L is on blade 10, 

In step 802 the largest remaining gap G s is divided by N which yields delta 1 
15 and the largest remaining gap G L is divided by N which yields delta 2. 

In step 804 the size threshold T s is reduced by delta 1 and the load threshold T L 
is reduced by delta 2. In step 806 the method of figure 2 is performed again with 
the reduced thresholds as determined in step 804. In case the sorting indices 
and the original object sequence obtained from the method of figure 2 by 
20 performing steps 200 to 202 have been saved, these steps can be skipped in 
order to start processing of the method of figure 2 in step 204. The result is a 
further improved balancing. 

Figure 19 shows an alternative approach for refining the balancing. In step 900 
delta 3 is calculated by calculating the difference of the sum of the storage 
25 capacity of the blades and the sum of the object sizes of all objects to be 
assigned to the blades and by dividing the difference by the minimum number of 
blades N. In step 902 delta 4 is calculated by calculating the difference of the 
sum of the load capacities of the blades and the sum of the object loads of all 
objects to be assigned to the blades and by dividing the difference by the 
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minimum number of blades N..ln the example considered here no normalized 
values are used in steps 900 and 902. 

In step 904 the size threshold T s is reduced by the normalised value of delta 3. 
The normalised value of delta 3 is obtained by dividing delta 3 by the storage 
capacity of one of the blades. 

Likewise the load threshold T L Is updated In step 906 by the normalised delta 4. 
Normalisation of delta 4 is performed by dividing delta 4 by the load capacity of 
one of the blades. 

The reduced size threshold T s and the reduced load threshold T L correspond to 
the theoretical limit of blade resources that are required for handling of the given 
objects. As the object granularity is finite the theoretical threshold limits will be 
surpassed in most cases: 

In order to refine the balancing the method of figure 2 is performed again (step 
908) on the basis of the size threshold T s obtained from step 904 and the 
reduced load threshold T u obtained from step 906 with the modification that the 
number of blades for the renewed performance of the method of figure 2 is 
limited by the minimum number of blades N that has been determined 
previously. In other words, the assignment procedure of figure 2 stops when the 
minimum number N of blades has been used up; this may have the 
consequence that there will be an excess amount of memory and/or load 
requirement for the last blade of the minimum number N of blades considered 
for the assignment procedure that surpasses the respective thresholds T s 
and/or T L . Again steps 200 to 202 of the method of figure 2 can be skipped in 
case the sorting indices and the sorted sequence has been stored when the 
method of figure 2 was performed the first time. In step 910 it is determined 
whether there is such an excess amount of memory requirement and/or load 
requirement for the last blade N, i.e. blade 10. If this is not the case the resulting 
assignment of objects to blades is output in step 912. 
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If the contrary is the case step 914 is carried out in order to calculate the values 
of delta 5 and/or delta 6. Delta 5 is obtained by dividing the excess amount of 
memory, if any, by the number of blades N. Likewise delta 6 is obtained by 
dividing the excess load requirement, if any, by the number of blades N. 

5 On this basis the size and/load thresholds are incremented in step 916. From 
there the control goes back to step 908. 

Steps 908 to 916 are carried out repeatedly until there is no longer an excess 
amount of memory and/or load requirement that cannot be provided by the 
given number N of blades. 

1 0 Figure 20 shows a further alternative for refinement of the balancing. First steps 
900 and 902 of the method of figure 1 9 are carried out in order to obtain delta 3 
and delta 4. The gaps delta3 and delta4 are now scanned by stepwise variation 
of the respective size and load thresholds in order to identify a more balanced 
assignment of objects to blades. For this purpose delta 3 and delta 4 are 

15 divided into equidistant intervals having step sizes delta 7 and delta 8, 
respectively, as shown in step 1000 The number of increments for performing 
the respective scans can be predefined or is user selectable. 

It is to be noted that the number of increments for scanning Gs and for scanning 
G L does not need to be the same. Preferably the scans are performed 
20 independently from each other such that the total number of assignments that is 
considered is the number of increments for the Gs scan multiplied by the 
number of increments for the Gl scan. 

In step 1002 the size and load thresholds are set to the respective theoretical 
minima that are required to provide sufficient blade resources for handling of the 
25 given number of objects. On this basis the method of figure 2 is performed 
again in step 1004. In step 1006 a statistical measure is calculated as a quality 
measure for the balancing of the assignment of objects to blades obtained as a 
result of step 1006, if any. For example, the standard deviation of the 
aggregated sorting index values of objects assigned to each one of the blades 
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is calculated. In other words, for each blade the total of the sorting index values 
of the objects that have been assigned to the blade is calculated. This provides 
one total sorting index value per blade. Next the standard deviation is calculated 
for the total sorting index values. 

5 In step 1008 at least one of the thresholds Ts or T L is incremented by the 
normalised value of delta 7 or the normalised value of delta 8, respectively. 
Next step 1004 is carried out again on the basis of the incremented size and 
load thresholds. Steps 1004 to 1008 are earned out repeatedly until the 
respective scans through G s and G L have been completed and the 
10 corresponding assignments of objects to blades have been obtained. In step 
1010 one of the assignments is selected based on the statistical measures. For 
example, the assignment having the lowest standard deviation is selected. 

It is to be noted that this procedure is limited by the minimum number of blades 
N. For assignments that do not fit on this given minimum number of blades N no 
15 statistical measure needs to be calculated as these assignments are not 
considered further in the procedure. 

Figure 21 shows a computer 108 that has processor 110 for running program 
112. Program 112 has module 113 for calculating a sorting index value for 
each object to be assigned to a blade. Further program 112 has module 1 14 for 
20 sorting of the objects by sorting index value and module 116 for assigning of 
objects to blades. 

Further computer 108 has storage 118 for storing a table listing the objects, 
object sizes, and object loads of objects to be assigned to blades, storage 120 
for storage of a storage capacity value of the blades, storage 121 for storage of 
25 a load capacity value of the blades, and storage 1 22 for storing of the number of 
blades. Further computer 108 has interface 124 for coupling to workstation 126. 

In operation the table with the object names/numbers, object sizes and object 
loads is entered via interface 124 and stored in storage 118. Further a storage 
capacity value for the storage capacity of each individual blade is entered via 
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interface 124 and stored in storage 120. Likewise a load capacity value for the 
load capacity of each individual blade is entered via interface 124 and stored in 
storage 121. 

Next program 112 is invoked. Program 112 calculates a sorting index for each 
5 object contained in the table stored in storage 118 on the basis of the 
normalised object size and object load. The resulting indices are entered into 
the table stored in storage 118 by module 113. Next module 114 sorts the table 
of storage 118 by decreasing storage index to provide a sequence of objects 
(cf. the sequence of figure 8). Next module 116 performs the method of figure 2 
10 in order to determine the minimum number of required blades. 

This minimum number is stored in storage 122 and is output via user interface 
124. This number can be a basis for a users investment decision for purchasing 
the number of blades to realise the data processing system being capable of 
handling the objects as listed in the table. 

15 In addition, module 116 can perform the methods of figure 18, 19 and/or 20 for 
refining of the balancing. 

Alternatively, computer 108 is one of the blades. In this instance computer 108 
can dynamically change the assignment of objects to blades when the object 
size changes. This way frequent swapping operations can be avoided. In other 
20 words, the creation of "hot spots" is prevented by balancing the load. 
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List of Reference Numerals 



1 00 cluster 

5 102 processor 

104 memory 

106 network 

108 computer 

110 processor 

10 112 Program 

113 module 

114 module 
116 module 
118 storage 

15 120 storage 

121 storage 

122 storage 

1 24 interface 

126 workstation 

20 
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CLAIMS 



A method of assigning objects to processing units of a cluster of 
processing units, each one of the objects having an object size and an 
object load, each one of the processing units having a storage capacity 
and a load capacity, the method comprising the steps of: 

- a) calculating an index based on object size and object load for 
each one of the objects, 

- b) sorting of the objects by index to provide a sequence of objects; 

- c) for each processing unit of the cluster: 

assigning of one or more of the objects to the 
processing unit in sequential order until a remaining 
storage capacity and/or a remaining load capacity of the 
processing unit is too small for consecutive objects of 
the sequence; 

deleting of the objects that are assigned to the 
processing unit from the sequence. 

The method of claim 1 , whereby step 1 c) is carried out repeatedly until the 
sequence is empty in order to provide a minimum number of the 
processing units. 

The method of claim 1 or 2, whereby the remaining storage capacity is 
determined by the difference between the storage capacity and the 
aggregated size of objects being assigned to the processing unit. 

The method of claim 1, 2 or 3, whereby the remaining load capacity is 
determined by the difference between the load capacity and the 
aggregated loads of objects being assigned to the processing unit. 
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The method of claim 3 or 4, further comprising the steps of: 

- d) determining a first largest gap between the aggregated size of 
objects being assigned to one of the processing units and the 
storage capacity, 

- e) determining a second largest gap between the aggregated load 
of objects being assigned to one of the processing units and the 
load capacity, 

- f) subtracting the first largest gap divided by the number of 
processing units from the storage capacity to provide a first 
threshold, 

- g) subtracting the second largest gap divided by the number of 
processing units from the load capacity to provide a second 
threshold, 

- h) performing step 1 c) again, whereby the remaining storage 
capacity is the difference between the aggregated size of the 
objects being assigned to the processing unit and the first 
threshold, and whereby the remaining load capacity is the 
difference between the aggregated load of the objects being 
assigned to the processing unit and the second threshold. 

The method of any one of the preceding claims 1 to 5 further comprises 
the steps of: 

• d) determining the total of the sizes of the objects, 

- e) determining the total of the loads of the objects, 

- f) determining a first difference between the total of the storage 
capacities of the minimum number of processing units and the 
total of the sizes of the objects, 
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- g) determining a second difference between the total of the load 
capacities of the minimum number of processing units and the 
total of the load of the objects, 

- h) subtracting the first difference divided by the minimum number 
5 of processing units from the storage capacity to provide a first 

threshold, 

i) subtracting the second difference divided by the minimum 
number of processing units from the load capacity to provide a 
second threshold, 

j) performing step 1 c) again, whereby the remaining storage 
capacity is determined by the difference between the aggregated 
size of the objects being assigned to the processing unit and the 
first threshold, and whereby the remaining load capacity is 
determined by the difference between the aggregated load of the 
objects being assigned to the processing unit and the second 
threshold, 

- k) in case that as a result of step 6 j) there is an excess amount of 
memory requirement for one of the processing units that 
surpasses the first threshold, dividing the excess amount by the 

20 minimum number of processing units and increasing the first 

threshold by the result of the division. 

- I) in case that as result of step 6 j) there is an excess load 
requirement for one of the processing units that surpasses the 
second threshold, dividing the excess load by the minimum 

25 number of processing units and increasing the second threshold 

by the result of the division, 

whereby steps 6 j), 6 k) and 6 I) are performed repeatedly until there 
is no such excess amount of memory requirement and no such 
excess load requirement. 



10 



15 
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7. The method of any one of the preceding claims 1 to 6, further comprising 
the steps of: 

- d) stepwise varying the first and second thresholds between 
respective first and second limits, 

- e) performing step 1 c) for each first and second threshold value, 
whereby the remaining storage capacity is the difference between 
the aggregated size of the objects being assigned to the 
processing unit and the first threshold, and whereby the remaining 
load capacity is the difference between the aggregated load of the 
objects being assigned to the processing unit and the second 
threshold, and whereby a statistical measure is calculated for the 
assignment of objects to the processing unit, 

- f) selecting one of the assignments of objects to processing units 
based on the statistical measure. 

3. The method of claim 7, whereby the first limit of the first threshold is given 
by the aggregated size of the objects divided by the minimum number of 
processing units, and whereby the second limit of the first threshold is 
given by the storage capacity, and whereby the first limit of the second 
threshold is given by the aggregated load of the objects divided by the 
minimum number of processing units, and whereby the second limit of the 
second threshold is given by the load capacity. 

). The method of claims 7 or 8, whereby the statistical measure is calculated 
by calculation a standard deviation or a variance of the totals of the indices 
of objects assigned to one processing unit. 

0. The method of any one of the preceding claims 1 to 9, whereby the 
objects are database tables of various sizes. 

1 . The method of any one of the preceding claims 1 to 10, whereby each one 
of the processing units is a blade or a blade server. 
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1 2. The method of any one of the preceding claims 1 to 1 1 , whereby the index 
of an object is calculated based on the sum of the normalised object size 
and object load and based on the absolute value of a difference between 
the normalised object size and the normalised object load. 

13. A computer program product for assigning objects to processing units of a 
cluster of processing units, each one of the objects having an object size 
and an object load, each one of the processing units having a storage 
capacity and a load capacity, the computer program product comprising 
instructions for: 

- a) calculating an index based on object size and object load for 
each one of the objects, 

- b) sorting of the objects by index to provide a sequence of objects; 

- c) for each processing unit of the cluster: 

assigning of one or more of the objects to the 
processing unit in sequential order until a remaining 
storage capacity and/or a remaining load capacity of the 
processing unit is too small for consecutive objects of 
the sequence; 

deleting of the objects that are assigned to the 
processing unit from the sequence. 

14. The computer program product of claim 13 the instructions being adapted 
to repeatedly carry out step 13 c) until the sequence is empty and to 
output a minimum number of the processing units that are required for the 
objects. 

15. The computer program product of claim 13 or 14, the instructions being 
adapted to perform the steps of: 
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- d) determining a first largest gap between the aggregated size of 
objects being assigned to one of the processing units and the 
storage capacity, 

- e) determining a second largest gap between the aggregated load 
of objects being assigned to one of the processing units and the 
load capacity, 

- 0 subtracting the first largest gap divided by the number of 
processing units from the storage capacity to provide a first 
threshold, 

- g) subtracting the second largest gap divided by the number of 
processing units from the load capacity to provide a second 
threshold, 

- h) performing step 13 c) again, whereby the remaining storage 
capacity is the difference between the aggregated size of the 
objects being assigned to the processing unit and the first 
threshold, and whereby the remaining load capacity is the 
difference between the aggregated load of the objects being 
assigned to the processing unit and the second threshold. 

16. The computer program product of claims 13, 14, or 15 the instructions 
being adapted to perform the steps of: 

- d)determining the total of the sizes of the objects, 

- e) determining the total of the loads of the objects, 

- f) determining a first difference between the total of the storage 
capacities of the minimum number of processing units and the 
total of the sizes of the objects, 
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- g) determining a second difference between the total of the load 
capacities of the minimum number of processing units and the 
total of the load of the objects, 

- h) subtracting the first difference divided by the minimum number 
of processing units from the storage capacity to provide a first 
threshold, 

_ i) subtracting the second difference divided by the minimum 
number of processing units from the load capacity to provide a 
second threshold, 

- j) performing step 13 c) again, whereby the remaining storage 
capacity is determined by the difference between the aggregated 
size of the objects being assigned to the processing unit and the 
first threshold, and whereby the remaining load capacity is 
determined by the difference between the aggregated load of the 
objects being assigned to the processing unit and the second 
threshold, 

- k) in case that as a result of step 16 j) there is an excess amount 
of memory requirement for one of the processing units that 
surpasses the first threshold, dividing the excess amount by the 
minimum number of processing units and increasing the first 
threshold by the result of the division. 

- I) in case that as result of step 1 6 j) there is an excess load 
requirement for one of the processing units that surpasses the 
second threshold, dividing the excess load by the minimum 
number of processing units and increasing the second threshold 
by the result of the division, 

whereby steps 16 j), 16 k) and 16 I) are performed repeatedly until there is 
no such excess amount of memory requirement and no such excess load 
requirement. 
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17. The computer program product of any one of the preceding claims 13 to 

16, the instructions being adapted to perform the steps of: 

- d) stepwise varying the first and second thresholds between 
respective first and second limits, 

- e) performing step 13 c) for each first and second threshold value, 
whereby the remaining storage capacity is the difference between 
the aggregated size of the objects being assigned to the 
processing unit and the first threshold, and whereby the remaining 
load capacity is the difference between the aggregated load of the 
objects being assigned to the processing unit and the second 
threshold, and whereby a statistical measure is calculated for the 
assignment of objects to the processing unit, 

- f) selecting one of the assignments of objects to processing units 
based on the statistical measure. 

18. The computer program product of any one of the preceding claims 13 to 

17, the instructions being adapted to calculate the index of an object on 
the basis of the sum of the normalised object size and normalised object 
load and on the basis of the absolute value of the difference of normalised 
object size and normalised object load. 

19. A data processing system for determining a minimum number of 
processing units of a cluster of processing units for a given number of 
objects having various object sizes and object loads, the data processing 
system comprising: 

- means for calculating an index based on object size and object 
load for each one of the objects, 

- means for assigning of one or more of the objects to a processing 
unit in sequential order until a remaining storage capacity and/or a 
remaining load capacity of the processing unit is too small for 
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consecutive objects of the sequence and for deleting of the 
objects that are assigned to the processing unit from the 
sequence. 

- means for outputting of the minimum number of the processing 
5 units. 

20. A blade server having balancing means for dynamically assigning objects 
to a plurality of blade servers, each one of the objects having an assigned 
index that is based on object size and object load, the balancing means 
being adapted to assign objects to the blade servers by the steps of: 

10 - a) sorting of the objects by index to provide a sequence of objects; 

- b) for each processing unit of the cluster: 

assigning of one or more of the objects to the 
processing unit in sequential order until a remaining 
storage capacity and/or a remaining load capacity of the 
15 processing unit is too small for consecutive objects of 

the sequence; 

deleting of the objects that are assigned to the 
processing unit from the sequence. 



20 
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Abstract 

A method of assigning objects to processing units 



The present invention relates to a method of assigning objects to processing 
units of a cluster of processing units, each one of the objects having an object 
size and an object load, each one of the processing units having a storage 
capacity and a load capacity, the method comprising the steps of: 

10 - a) calculating an index based on object size and object load for 

each one of the objects, 

- b) sorting of the objects by index to provide a sequence of objects; 

- c) for each processing unit of the cluster: 

- * assigning of one or more of the objects to the 
15 processing unit in sequential order until a remaining 

storage capacity and/or a remaining load capacity of the 
processing unit is too small for consecutive objects of 
the sequence; 

deleting of the objects that are assigned to the 
20 processing unit from the sequence. 

(figure 2 ) 
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